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Abstract — The index coding problem is a fundamental trans- 
mission problem which occurs in a wide range of multicast 
networks. Network coding over a large finite field size has been 
shown to be a theoretically efficient solution to the index coding 
problem. However the high computational complexity of packet 
encoding and decoding over a large finite field size, and its subse- 
quent penalty on encoding and decoding throughput and higher 
energy cost makes it unsuitable for practical implementation 
in processor and energy constraint devices like mobile phones 
and wireless sensors. While network coding over GF(2) can 
alleviate these concerns, it comes at a tradeoff cost of degrading 
throughput performance. To address this tradeoff, we propose 
a throughput optimal triangular network coding scheme over 
GF(2). We show that such a coding scheme can supply unlimited 
number of innovative packets and the decoding involves the 
simple back substitution. Such a coding scheme provides an 
efficient solution to the index coding problem and its lower 
computation and energy cost makes it suitable for practical 
implementation on devices with limited processing and energy 
capacity. 

I. Introduction 

The index coding problem Q), (2) is an instance of packets 
P = {ci, C2, ■ ■ ■ , cm} transmission problem over a noise- 
less channel by a transmitter to multiple receivers R = 
{Ri, i?2, Rn}, given the subset of packets each receiver 
already has H(Ri) C P, and the disjoint subset of packets 
each receiver wants W(Ri) C P, such that the total number of 
transmissions is minimized, as shown in Fig. Q] It is intuitive 
to see that the index coding problem can also be extended 
for erasure channel, where the transmitter solves the index 
coding problem multiple times until all the receivers have 
received the packets in their want set. An efficient solution 
to the index coding problem has several applications, for 
instance the index coding problem has been shown to occur in 
content distribution networks [2 |, satellite communication net- 
works 0, wireless routing |3| and wireless multicasting |4j, 
0, amongst many other. Network coding 0, ||4] has been 
proposed as an efficient solution for the index coding problem. 
In network coding, the transmitter generates a coded packet 
by linearly mapping packets c m G P, with coefficients g m , 
from a finite field GF(2 q ), q 6 Ni, which is then transmitted 
to the receivers. As we will show later, minimizing the total 
number of transmissions requires that the transmitted coded 
packet is linearly independent from the set of packets H(Ri) 
for the maximum possible number of receivers, ideally for all 
unsaturated receivers. 
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Fig. 1 . Application of the index coding problem. Wireless router multicasting 
a file stream to N receivers. 



To illustrate how network coding can be beneficially appli- 
cable for the index coding problem, consider for example a 
wireless network where Rq is multicasting packets c\ and c-i 
to R\ and i?2- However, R\ receives c\ but not c%, whereas Ri 
receives C2 but not c\. In this case, rather than retransmitting 
packet c\ and C2 in 2 different time slots, it is possible for 
the transmitter to encode the packets c\ ©C2 over GF(2), and 
transmit the encoded packet in 1 time slot. On receiving the 
encoded packet both the receiver can recover the lost packet 
by decoding the original packet with the encoded packet. This 
therefore reduces the total number of transmissions and hence 
improves the network bandwidth. 

However the bandwidth performance of network coding for 
a class of multicast network with a packet batch size of M, 
being transmitted to N receivers, and a Bernoulli packet loss 
probability of pi for receiver Ri, has been shown to vary 
with the size of the finite field GF(2 q ) over which packet 
coding is performed 0,0. For random linear network coding 
(RLNC), where the encoding coefficient is randomly selected 
from GF(2 q ), the probability that the encoded packet y a is 
linearly independent of all the previously received packets 
increases logarithmically with respect to q and is bounded by 
an asymptote [4]. For a deterministic linear network coding 
(DLNC) scheme where the encoding coefficient g m , is selected 
deterministically by a polynomial-time algorithm, to guarantee 
that an innovative packet] is transmitted at every transmission, 

'An innovative packet with respect to a specified receiver, is defined as a 
packet which the receiver can't generate from the set of packets it already 
has. 



it has been theoretically shown that the size of the finite 
field from which the encoding coefficients g m , are selected is 
bounded as q > [log 2 Af| 0, 0. Clearly network coding over 
GF{2) can only guarantee an optimal solution for a network 
with 2 receivers. It has also been shown that the solution for 
the index coding problem using network coding over GF(2) is 
a NP-complete problem 0, Q. In network coding literature, 
encoding over GF(2) is referred as XOR coding, whereas 
encoding over GF(2 q ) for q > 1 is referred to as linear 
network coding (LNC), we will henceforth use these terms to 
distinguish between the finite field size from which the coding 
coefficients are selected. 

While LNC over a large field size GF(2 q ) has been 
proposed as a theoretically viable solution to the index cod- 
ing problem, such benefits do not come without a tradeoff. 
The most significant drawback of using LNC is the high 
computational cost of performing packet encoding and de- 
coding (9J. Unlike XOR coding and decoding, where the 
main mathematical operation is XOR addition, LNC coding 
involves multiplication and addition whereas LNC decoding 
involves multiplication and Gaussian elimination. LNC has 
packet encoding complexity of 0{MB) J6], where B is 
the length of the data packet. For RLNC the complexity 
of generating M random coefficients is given as 0(MG), 
where G is the constant complexity of generating a random 
number 0. For DLNC the complexity of deterministically 
generating M coding coefficients is given as 0(N 2 M 3 ) 0. 
The decoding computational complexity of decoding M LNC 
encoded packets is given as 0(M 2 B+M 3 ) for both RLNC 
and DLNC, and hence the decoding complexity per coded 
packet is given as 0(MB + M 2 ). 

The high energy cost arising because of LNC computational 
complexity makes LNC unsuitable for practical implementa- 
tion in battery constrained devices like mobile phones and 
wireless sensors to solve the index coding problem. Mobile 
phone batteries, like wireless sensor, suffer from severe energy 
limitation, and energy optimization for various smartphone 
applications is becoming an important parameter for designing 
smartphone applications which can sustain a longer battery 
life ifTOl . ATI . Using higher capacity battery can't be regarded 
as a viable alternate approach, as higher power consumption 
results in the overheating of the devices. Experimental evalua- 
tion of RLNC over GF(2 8 ) for iPod Touch 2G has shown 
that packet encoding and decoding can account for up to 
33% of the battery's energy consumption lfl2l . Whereas it 
has been shown that XOR-encoding of 2 packets, each 1000 
bytes long only consumes 191 nJ [13] of energy. Given that 
transmission of a packet of the same length over IEEE 802.11 
network on Nokia N95 consumes 2.31 mJ iTOl of energy, 
the overall energy cost of XOR-coding has no apparent effect 
on the total energy cost of encoding and transmitting a XOR 
coded packet. With energy efficiency becoming an increasingly 
important concern for communication networks ifTTI . even if 
the solution for the index coding problem using LNC is to be 
limited for desktop computers and access points (AP), which 
can afford high energy cost, such an approach would not be 



green communication feasible because the higher energy cost 
of LNC corresponds to a higher carbon footprint penalty. 

LNC also suffers from low encoding and decoding through- 
put. The computational penalty cost of LNC on encoding and 
decoding throughput has also been practically demonstrated 
on a testbed [14]. In this work the authors show that in 
general, encoding over GF(2) is approximately 8 times faster 
than encoding over GF(2 8 ) on iPhone 3G implementation. 
Similarly decoding over GF(2) is approximately 6 times faster 
than decoding over GF(2 & ) on the same testbed. 

From the previous discussion it is apparent that a viable 
solution to the index coding problem is to use a coding 
scheme which can deliver the bandwidth performance of LNC, 
while affording the computation cost of XOR coding. To 
address this gap, in this paper we propose a triangular pattern 
based packet coding scheme, where packets are encoded 
over GF(2), and decoding is done using the simple back 
substitution scheme rather than Gaussian elimination method. 
Further such coding scheme can guarantee enough 'pool' on 
linearly independent coded packet to deliver optimal through- 
put performance. Such optimization tradeoff between energy 
cost and bandwidth performance has also garnered interest 
recently J9) . In addition unlike deterministic coding algorithms 
such as the traditional XOR-coding schemes [3|, [15] and 
DLNC polynomial-time algorithm (J5j which require packet 
feedback information for every transmitted packet from all 
the receivers, the performance of our proposed coding scheme 
like RLNC is independent from the constraint of having packet 
feedback information. 

Our paper is organized as follow. We first highlight the cur- 
rent research directions to reduce the energy cost of LNC and 
increase its encoding and decoding throughput in Section [EI] A 
formal problem formulation and system parameters are stated 
in Section [HI] We then propose our proposed coding scheme 
in Section [TV] and the evaluation of the triangular coding 
throughput performance, packet overhead and computational 
complexity along with comparison with current network cod- 
ing schemes in Section [V] Numerical results of the packet 
overhead of our coding scheme and its comparison with the 
packet overhead of other network coding scheme are given in 
Section [VI] Finally we conclude with the main contributions 
and results of our paper in Section IVIII 

II. Related Work 

Current research direction to reduce the computation cost 
of LNC, and increase the encoding and decoding throughput 
falls under three broad categories. The first is the use of 
Gauss-Jordan elimination method running on parallel multi- 
processors system as shown in [16], fl2l . IfTTI to increase 
the decoding throughput. This is based on the well-known 
computer science principle that even though the Gauss-Jordan 
elimination method requires more computation steps relative 
to Gaussian elimination, Gauss-Jordan elimination method can 
nonetheless speedup the processing time required to solve a 
matrix of fixed size as the number of processors increases [ 18 1. 



This is explained due to the better load balancing charac- 
teristics and lower synchronization cost of the Gauss-Jordan 
elimination method. However such method comes at the trade- 
off cost of increasing number of processors requirement, and 
higher energy cost. Even though both Gaussian elimination 
and Gauss-Jordan elimination have the same computational 
complexity order, Gauss-Jordan elimination requires more 
number of computation steps. A parallel Gauss-Jordan algo- 
rithm for multi-processor system is shown to require approxi- 
mately 50% more operations than Gaussian elimination IfTSll . 

The second major research direction to reduce the decoding 
computation cost of LNC is to use sparse coding coefficients. 
A sparse matrix can loosely be described as a matrix with 
more number of '0's. In |7], [ 19 1 it has been shown that when 
the size of the coding field is bounded as q > [log 2 N~\ , there 
always exists a set of coding coefficient such that the coded 
packet is linearly independent from the set of packets each of 
the N receivers have. However, the SPARSITY problem |[T9l 
of finding such set of M coding coefficients of which M — u> 
coding coefficients are '0' and the coding coefficient is linearly 
independent from the set of packets received by all the 
receivers, is an NP-complete problem with respect to N. By 
assuming fixed N, the computational complexity of solving a 
M x ill matrix of rank M by the receiver reduces from 0(M 3 ) 
to 0(M 2 oj), where cj < M. Unfortunately such reduction in 
the complexity of Gaussian elimination come at the tradeoff 
cost of solving the SPARSITY problem by the transmitter 
which has complexity given as 0(M N (MN 2 )) fl9l . 

The third major approach to reduce the computation cost is 
to use the trivial approach of using smaller packet batch size 
(see [20] and references therein). However such an approach 
of decreasing the packet batch size comes at the tradeoff cost 
of decreasing throughput performance 1151 . H, (2fl . 

III. Problem Statement 

Similar to 0, |8), lfl5ll we consider the problem of a 
wireless transmitter Rq multicasting M packets to N receivers. 
M is called the packet batch size. The iid Bernoulli packet 
loss at each receiver Ri is given as pi. At the start of 
the transmission, W(Ri) = P,Vi. After each transmission, 
every receiver updates it want and has set, based on the 
packet it has received. i?o continues transmitting packets until 
\H(Ri)\ — \P\, Vi. Therefore our considered problem of 
multicasting M packets to N receivers over an erasure channel 
is a general case of the index coding problem. Let T denote the 
number of transmissions necessary before all the N receivers 
receive M innovative packets. The problem statement which 
we are interested to solve can be written as an optimization 
problem 
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In this paper we propose a triangular packet coding scheme 
which solves (Q~|) optimally, with encoding and decoding com- 
putational complexity of the same order as that of traditional 
XOR packet encoding and decoding. 

Therefore the main contribution of our paper is twofold, 
first we address the energy cost and bandwidth performance 
tradeoff associated with network coding [9| in this paper. 
Secondly, from the information theory perspective, we show 
that unlike previous works which have concluded that the 
optimal solution for the index coding problem using over 
GF{2) is NP-complete Q, 0, 0, fl2), it is possible to 
obtain an optimal solution for wireless multicasting using 
XOR coding when the constraint of adding redundant bits to 
packets is relaxed. 

A. Performance Bound 

The main performance measure for this work is the number 
of transmissions required to transmit M innovative packets 
to N memory-based receivers. The performance bound can 
be derived when an optimal network coding scheme is con- 
sidered, where every packet transmission is an innovative 
transmission. Similar to 0, ll22l . when packet reception is 
characterized by the binomial probability law, with nonho- 
mogeneous packet loss probabilities, the average number of 
transmissions required to transmit M innovative packets to N 
memory-based receivers is 
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where pj is the packet loss probability for receiver Rj . 

We may approximate the above result by only including 
those whose packet loss are high. With this approximation, 
the above result can be rewritten as 
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where p = max{pi} and k is the number of receivers with 
packet loss probability pi = p. 

Axiom 1: For a network with specified M, k and p any net- 
work coding scheme which can transmit Q(p,k, M) innovative 
packets in Q(p, k, M) transmissions is considered an optimal 
coding scheme. 

IV. Proposed Coding Scheme 

We first illustrate the practical usefulness of our proposed 
coding scheme by the aid of a simple example. Consider the 
case of Rq multicasting packets c\ and C2 to 4 receivers, 
i?i,...,i?4. After the first 2 transmissions, only R\ receives 
ci and only R2 receives 02- Now to transmit a packet which 
is innovative for all the receivers, the only possibility is to 
transmit c\ © C2, when coding is limited to GF(2). However 
let us assume that only i?3 receives this coded packet. Given 
the constraint of coding over GF(2) it is easy to verify that 
after these 3 transmissions, there is no possibility to transmit 
a packet which will be innovative for all the receivers. This 



is also consistent with previous theoretical analysis (5), 0, 
which have proven that an innovative packet transmission, for 
every transmission is only possible on the condition that the 
field size is larger than or equal to the number of users. 

In our proposed coding scheme we go around this infor- 
mation theory limitation, by adding redundant bits to c\ and 
C2, then encoding these packets over GF(2), and including 
information about these redundant bits added to each packet 
in the packet header of the encoded packet. For the given 
example, we add bit '0' at the head of data payload of packet 
c\. To equalize the length of both the packets we add bit '0' at 
the tail of the data payload of packet C2, and then encode these 
packet over GF(2). Let us assume that all packets c m e P, 
1 < m < M, have equal data payload length of B bits. The 
binary bit pattern of the data payload for packet c m can be 
represented as (bi >m , 6 2 , m , bB.m),bj^ m £ {0, 1}. Hence the 
bit pattern of ci with one redundant bit added at the head 
of the packet is given as (0, bii, &b,i), and that of C2 as 
(&i,2, •••) &B,2> 0). We denote such modified packet as c m ^ m , 
where r m £ No, is the number of '0's added at the head of the 
data payload of packet c rn . The new encoded packet is denoted 
as 01,1 ©C2,o- The packet header will include information about 
r m for each packet used for encoding, and we will study the 
overhead cost in subsequent section. 

This new encoded packet ci.i © C2,o will be innovative for 
all the 4 receivers. If R3 receives this packet, it will have the 
information about bit bx 2 as bit '0' was added as a redundant 
bit in packet c\ from the packet header. Since R3 also has 
packet Ci ©C2, it now has information about bit 612 and 611 © 
bi y 2- Using this information it can decode bit 61.1 ■ Bit 611 
is then substituted in b\,\ © 62.2, from the encoded packet 
Ci,i © C2.0, to obtain bit 6 2 ,2- Therefore using this bit-by-bit 
simple back substitution method, R 3 can decode all the bits 
of packet c x and c-i- 

A. Generation of Innovative Packets 

For a system such the total number of redundant bits 
added is r max to a packet of length B bits, then we note 
that generating c my r m by adding of redundant bits to c m is 
equivalent to the following operation 

An encoded packet ci <ri © C2, r2 © • ■ • © cm,t m i s sa id t0 
be innovative to a receiver if the encoded packet is linearly 
independent with respect to all other encoded packets that 
the receiver already possesses. Our aim is find a sequence of 
coefficient sets such that an encoded packet with a coefficient 
set is always linearly independent to encoded packets of any 
collection of other coefficient sets, and secondly, the decoding 
procedure can be solved using the back substitution method. 

Assuming encoding of M packets, for an encoded packet 
Va = ci,n © c 2 ,r 2 © ■•■ © CM,r M ' tne tota l number 
of redundant bits added to each packet c m is r max = 
max{ri, r2, tm}- We represent the unique id of encoded 
packet y a as (n, I'm) a- The encoded packet can be 



written as 

We shall propose a natural number based triangular substitu- 
tion based network coding. The Gaussian elimination method 
consists of 2 major steps, the matrix triangularization step 
and the back substitution step. The matrix triangularization 
step accounts for 0(M 3 ) steps, while the back substitution 
step requires only 0(M 2 ) steps [23]. Therefore our aim is to 
design a coding scheme such that the receiver does not have 
to perform the triangularization step. In other words, with our 
proposed coding scheme, the encoding packets are linearly 
independent and back substitution ready. 

If packet encoding is performed such that for an encoded 
packet, y a with an id of (r%, f2, rjvf)o> r m is mapped to 
an element of the set of natural number sequence given as 
{0, 1, (M — 1)} using a bijective function, then a collection 
of encoded packets can easily form a triangular augmented 
matrix. Without loss of generality, assume that the first en- 
coded packet (also called the initial pattern) has an id given 
as (0, 1, 2, 3, M — l)i. Then to generate the next encoded 
packet, we anchor '0' at its position and rotate all other 
terms rightward which results in (0, M — 1, 1, 2, M — 2) 2 . 
Similarly the consecutive encoded packets in the series are 
generated as (0, M — 2, M - 1, 1, M -3) 3 , (0,M-3,M- 
2, M - 1, M - 4)4, (0, 2, 3, 4, M - 1, \) M -\- We 
call all encoded packets in this series, with the '0' anchored 
in one position, to form a group. We may now anchor '0' at 
the second position starting with an id (1, 0, 2, 3, M — 1)m 
for the next group. By rotating all other terms except '0', 
we get another group of M — 1 encoded packets. Given M 
positions for '0' to anchor, we yield altogether M groups of 
M — 1 encoded packets with the very last encoded packet 
being (2,3,4,..,M-l,l,0) M(M -i). 

But what if the transmitter needs to generate additional en- 
coded packets? In such case, after the M (M — 1) permutations 
has been exhausted, the transmitter may start with another 
initial pattern of (0, 2, 4, 6, 2M — 2)jv/(a/-i)+i which is 
derived from the earlier initial pattern with each r m multiplied 
by a constant a. Let a = 2, and we call this collection of 
encoded packets the second round. Similarly, the transmitter 
can generate another M(M — 1) encoded packets for the third 
round by setting a = 3. Theoretically, we may continue with 
a = 3,4, 5, ... without a limit, only that a higher value of a 
results in a larger value of r max which implies more redundant 
bits for the encoding. 

In the following, we shall show that our proposed coding 
scheme always generate innovative packets. In other words, the 
coefficient matrix of any given n encoded packets where n < 
M gives a rank of n. While our proposed coding scheme mixes 
standard arithmetic (i.e. multiplication by shifting) and finite 
field arithmetic (i.e. exclusive-or), use of standard arithmetic 
is sufficient to prove that every generated packet is innovative. 

Lemma 1: Consider a system of M packets. Given an 
encoded packet y a with an id of (0, 1, 2, 3, M — l) a , the 



coefficient matrix formed by all M — 1 encoded packets in the 
same group of y a including y a gives a rank of M — 1. 

Proof: We first list all the ids of M — 1 encoded packets 
in the same group of y a including y a as follows where 
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From the id of y a , we know that r max = M — 1 and a 
The above list of encoded packets can be expressed by 
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For each pair of consecutive equations above, we multiply 
the upper one by 2 a and then minus with the lower one. After 
this simplification, we get 



which clearly gives a rank of M — 1. a 

For other groups, we may reorder the coefficients such that 
r\ = and then repeat Lemma Q] to show that other groups 
possess the same property. Furthermore, Lemma Q] can be 
easily extended to the case of a = 2, 3, ... which applies to a 
group belonging to higher rounds. 

Lemma 2: Consider a system of M packets. A collection of 
all M — 1 encoded packets in the same group and an encoded 
packet from a different group gives a rank of M. 

Proof: We shall prove by calculating the determinant of 
the matrix. Without loss of generality, we consider a group, 
G, of encoded packets starting with the initial pattern of 
(0, 1, 2, 3, M— Based on Lemma[U after simplification, 
these encoded packets can be represented by 
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Let us consider an arbitrary encoded packet, y a , with the 
following representation 

Va = Aici + A 2 c 2 + A3C3 H h XmCm- 



Combining the last two results, we form a coefficient matrix 
and express its determinant, d as 
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The condition for y a to be linearly independent of the 
encoded packets from group G is that d produces a none zero 
value. After some algebraic manipulation on d and applying 
the condition where d ^ 0, we yield the following condition 
where 
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coding scheme demands 
ping between {ri, r 2 , rjf} and 
other words, {Ai, A 2 , Am} are 
| 2 (M-i) ;2 (M-2) ; ... ; 2°}. The only case that © cannot be 
met occurs when Ai = 2^ M_1 \ In other cases, we always 
have 
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and since 2 M ~ 
met. 

The setting of Ai =^ 2( M_1 ) requires that r\ ^ for y a of 
id (ri, r 2 , rn). As y a is drawn from a group other than 
group G, its '0' is anchored at a different position rather than 
at r\ like those in group G, thus we have t\ ^ 0, and hence 
y a is linearly independent of the encoded packets from group 
G. ■ 

Applying a quick test using (O in Lemma shows that 
based on our coding scheme, a pick of an encoded packet 
even from a group of other rounds satisfies the condition. 
Consequently, with the above lemmas, we established that 
every encoded packet is an innovative packet. 

B. Packet Decoding 

We now show that such a coding scheme can be solved 
by substitution method. We illustrate the concept using an 
example depicted in Fig. [2] In our example, we consider a 
set of 4 original packets, that is M = 4. After transmissions 
of several encoded packets, we consider that a receiver has 
successfully collected 4 encoded packets, y\, y 2 , 7/3 and 2/4 
with the packet ids of (0, 1, 2, 3)i, (1, 0, 2, 3) 2 , (3, 0, 1, 2) 3 and 
(1, 2, 3, 0)4 respectively. In Figure [2] each table represents an 
encoded packet where each row lists the bits of an original 
packet involved in the encoding. It can be seen that the first 
bit of yx counting from the left is encoded by &i ; i ® © © 
which equals 6i i i. Similarly, b\ j2 and can be obtained 
from the first bits of j/ 2 and 7/4 respectively. Now, the decoder 
may proceed to the second bit position of the 4 packets. 
Substituting 611, 61. 2 and 6^4 into the [M — 1) matrices, 62,1, 
&2,2, &2,4 and 61.3 can be obtained immediately. Moving on to 
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Fig. 2. An illustrating example to show how packet decoding is done 
on triangular network coding scheme. Highlighted bj m represent calculated 
bit value from an equation with only 1 unknown variable, whereas non- 
highlighted bj m represent substituted bit value. Bits '0' are known bit values 
from the packet's header. Packet ?/2 and yz are from the same group. 

the bit position, bits 63 x, ^3,2, ^2,3 and 63,4 can be immediately 
solved by substitution. The process can continue further until 
all unknown bits are solved. By this way, a receiver can decode 
the bits of all 4 packets through the simple back substitution 
at the bit level. 

V. Performance Evaluation 

A. Throughput Performance 

Based on Axiom Q] if for a network with specified M, 
k and p, our proposed coding scheme can generate at least 
Q(p, k, M) innovative packets, then it can be considered as an 
optimal coding scheme. It is easy to verify that for 

a(M 2 - M) > Q[p, k, M) (4) 

there always 3a such that the above inequality is satisfied. 
As we will show later in Section [VI] for most practical 
setting (generally corresponding to M > 10) a = 1 is 
sufficient to guarantee at least Q(p, k, M) innovative packets. 
For smaller M, a small value of a is sufficient, and therefore 
our proposed coding scheme does not have significantly high 
packet overhead. In practice, a very small value of M is often 
not desirable as it does not provide sufficient pool of packets 
to utilize the benefit of network coding. We formally study 
the packet overhead cost of our proposed coding scheme in 
subsequent subsection. 



The throughput performance of DLNC is optimal when 
Q > riog 2 jV] . However in the paper [51, the optimal bound 
and DGC (Dynamic General Codings throughput perfor- 
mance results do not match. This is explained as follow. The 
throughput bound the authors consider is not tight, however 
since the DGC algorithm claims to be able to always find 
coding coefficients which is linearly independent for all the 
receivers, implies that the DGC is an optimal coding scheme. 

Assuming fixed N, sparse LNC can be solved in polyno- 
mial time to generate sparse and linearly independent coding 
coefficients, otherwise optimal throughput for sparse LNC is 
NP-complete. However since solving SPARSITY for a fixed 
network size is a special case of the SPARSITY problem, 
therefore in the general sense the throughput optimality of 
sparse LNC is considered NP-complete. 

B. Packet Overhead 

For LNC, the packet header needs to include information 
about encoding coefficient g m used to multiply with c m . If 
g m £ GF(2 q ), the number of bits required to represent g m is 
given as log 2 2 9 . Since there are M such encoding coefficients, 
the total packet overhead of RLNC is given as M log 2 2 9 , 
which reduces to Mq bits. For an optimal DLNC and sparse 
coding, q is bounded by q > [log 2 N~\ (5), 1191 . therefore the 
total packet overhead for an optimal DLNC and sparse coding 
scheme is given as M |~log 2 AT] . 

For our proposed coding scheme, the packet overhead is 
given by r max redundant bits added to every packet, and 
M\\og 2 r max \ bits to store the encoded packet unique id 
(rx, rM) a in the encoded packet header. Therefore the total 
packet overhead of our proposed coding scheme is given 
as r max + M\\og 2 r max \ bits, where r max = a(M - 1), 
and a corresponds to the smallest value required to satisfy 
inequality |@J. For a = 1 (which satisfies inequality 
when M is reasonably large), the total packet overhead is 
M - 1 + Mflog 2 (M - 1)] or simply M + M |~log 2 M] if we 
ignore the insignificant constant term of minus one. 

We would like to further point out that the packet overhead 
of triangular network coding can be reduced to approximately 
M bits, which is the number of redundant bits added to each 
packet. The only information which needs to be included in 
the packet header is the packet batch size, the group index 
and packet index. Since triangular network coded packets' id 
follow a sequence, with these information in the packet header, 
the receiver can reconstruct the packet id of the coded packet. 

However for our coding comparison we assume the packet 
overhead of M + M [log 2 M ] for triangular network coding 
to provide fair comparison with RLNC packet overhead, since 
assuming that both the transmitter and receiver have the same 
random number generator (RNG), all what a RLNC coded 
packet then needs to include in the packet header is the seed 
and packet index. Using the seed and packet index, the receiver 
can then regenerate the random coding coefficients. 

2 DGC is the name of a DLNC based algorithm demonstrated in 0. 



C. Computational Complexity 

We evaluate three distinct computational complexity. The 
algorithm complexity refers to the complexity of generating the 
coding coefficients for M coded packets. The total algorithm 
complexity is given as M times the complexity of finding M 
coding coefficient for each coded packet. The encoding com- 
plexity refers to the complexity of encoding M coded packets. 
Whereas the decoding complexity signifies the complexity of 
decoding M innovative coded packets by a receiver. 

We evaluate the computational complexity for M coded 
packets for uniformity in comparison, though in practise 
the total algorithm and encoding complexity would also be 
dependent on the throughput optimality of the coding scheme. 
Optimal coding scheme will need to run the algorithm and 
perform coding Q(p, k, M) times only, whereas suboptimal 
coding schemes will need to run the algorithm and perform 
coding Q(p, k, M) + C times, where C is a coding scheme 
dependent arbitrary constant. 

The decoding complexity is more significant than the algo- 
rithm and encoding complexity. This is because the algorithm 
and encoding computation cost is borne by a single transmitter 
which is often not a battery constrained device such as an AP, 
whereas the decoding cost is borne by N receivers, which are 
often battery and processor constrained such as smartphones. 

We evaluate the encoding and decoding complexity at 
bit level, so that this way we can distinguish between 
the complexity of multiplication used in LNC and addition 
used in XOR coding. For two non-negative integers a and 
b, the multiplication and addition complexity is given as 
0((log 2 a)(log 2 b)) and 0(log 2 a + log 2 b) respectively |24|. 
While there exist different optimization algorithms to reduce 
the computational complexity of multiplication, such optimiza- 
tion survey is beyond the scope of the paper. 

1) Traditional XOR: The encoding complexity of a tra- 
ditional XOR packet encoding of m packet^ is given as 
(D(mB), where m < M. XOR encoding is performed bit- 
by-bit. For each bit (6j, m ) position it needs to evaluate the 
value of m bits in location j. Therefore the average compu- 
tational complexity to generate M coded packets is given as 
O(MmB). Similarly XOR-encoded packet can be decoded by 
performing the XOR operation on a set of packets, which has 
complexity given as 0(mB). To decode M encoded packets, 
the average computational complexity is given as O(AlmB). 

For the algorithm complexity we consider the sort-by- 
utility algorithm [ 15 1 for our evaluation, which has complexity 
given as 0(M 2 log 2 M). We have chosen the sort-by-utility 
algorithm, as it has been shown to be an efficient XOR coding 
algorithm with respect to other known XOR coding algorithms 
in ifDl . 

2 ) RLNC: RLNC needs to perform random number genera- 
tion, multiplication and addition during encoding. Generating a 
random number has constant time complexity of G. Therefore 

3 XOR coding is often performed on a subset of packets from P fj), 1 151 , 
where \P\ = M, the cardinality of the subset of packets over which encoding 
is performed varies for each encoded packet and is algorithm dependent. 



the total algorithm complexity is given as 0(M 2 G). The 
operation g m 'C m has complexity given as 0(qB). To generate 
a coded packet M such operations needs to be performed, 
hence the complexity to generate a coded packet is given as 
O(MqB). Therefore the total encoding complexity to generate 
M coded packets is given as 0(M 2 qB). 

For decoding, RLNC needs to perform Gaussian elimination 
with complexity given as 0(M 3 ), and then multiplication and 
addition of M coded packet with the inverse matrix. Therefore 
the total decoding complexity is given as 0(M 2 qB + M 3 ). 

3) DLNC: The only known DLNC algorithm is the DGC 
algorithm given in [5]. DGC has algorithm complexity given 
as 0(N 2 M 3 ) for one coded packet. Its encoding and decoding 
complexity is the same as that of RLNC. 

4) Sparse LNC: Sparse LNC is a special case of DLNC. 
Unlike DLNC, where the main objective is to find coding co- 
efficients which are linearly independent from received coding 
coefficients by all the receivers, in sparse LNC, the objective 
is not only to find linearly independent coding coefficients, but 
also coefficients which are sparse. The algorithm complexity 
for sparse LNC is given as 0{M N (MN 2 )), which reduces 
to 0(M N N 2 ). Since there are large number of '0' coding 
coefficients in sparse LNC we can ignore the computational 
cost of g m ■ c m when g m = 0. Hence the coding coefficient 
is given as O(MqujB), where ui is the number of non-zero 
components, w < M. The decoding complexity is given as 
0(M 2 qB + ujM 2 ). 

5) Triangular Network Coding: The number of flops (or 
number of steps) for our proposed triangular network coding 
scheme is given as follow. Generation of unique packet ids for 
packet encoding has constant time complexity, as these packet 
id follow a natural number based sequence. The algorithm to 
generate the coefficient matrix for M packets has complexity 
of 0(M 2 ). For encoding complexity, the encoding of each 
packet has complexity of O(MB), since there are B bits from 
M packets, and XOR addition is the only required operation. 
The redundant '0' bits can be ignored both during encoding 
and decoding. Therefore to generate M such encoded packet, 
the overall complexity is given by 0(M 2 B). 

Once a receiver has M innovative encoded packets, then the 
complexity to decode these M packets is given as 0(M 2 B). 
We elaborate on the back-substitution process. M flops are 
required to find the solution of each bit (XOR addition), the 
solution for this equation is then substituted in A/ — 1 matrices. 
Since there are M such equations the total number of flops 
is given as M(2M — 1). Hence the decoding computational 
complexity for each bit location is given as 0(M 2 ), which is 
also consistent with the results shown in ||231 . 

Our proposed coding scheme results in triangular pattern 
both at the start and end of the packet (See Figure [2] for 
illustration), and therefore for a multiprocessor system, it is 
possible that one processor decodes the first half part of the 
packets from the start, while simultaneously another processor 
decodes the second half part from the end of the packet 
(reverse direction) using the substitution method. This can cut 
down the decoding time by half in a multiprocessor system. 
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Fig. 3. Packet overhead cost for N=10 and p=0.3. 

D. Packet Feedback Requirement 

We further note that traditional XOR, DLNC and sparse 
LNC require feedback information from all receivers to decide 
what coefficient sets to be used for further encoding of 
packets such that the encoded packets can be innovative to 
all receivers. In this aspect, RLNC and our proposed scheme 
do not need the feedback information. For RLNC, it simply 
generates encoded packets using random numbers. For our 
proposed scheme, the transmitter can simply continue the 
sequence of coefficient sets as the generated encoded packets 
will always be innovative. 

Collecting packet feedback from N receivers over an era- 
sure channel induces a very large overhead. For wireless 
networks, the IEEE 802. 1 1-2007 standard does not provide any 
MAC level reliability for multicast transmissions. Therefore 
deterministic network coding schemes are often based on 
an impractical arbitrary assumption that the transmitter has 
perfect feedback information for all transmitted packets. 

VI. Numerical Results 

We now study the packet overhead corresponding to the 
smallest a necessary to satisfy inequality (0J, and compare 
the results with the packet overhead of RLNC. Without loss 
of generality, we assume that p\ = ... = pn = P, such that 
k = N. Unlike the packet overhead of triangular coding, which 
is a monotonic function with respect to M, RLNC and DLNC 
packet overhead cost are a function of two variables. 

For RLNC there is no standardized field size suggested, 
though an overwhelming majority of works have considered a 
field size of GF(2 8 ) for RLNC as being sufficiently large 
enough to guarantee linear independency with very high 
probability 02), |9), |E), El- We therefore assume q = 8 
for the evaluation of RLNC packet overhead. 

The result of packet overhead for various coding scheme for 
different M, N, and p is given in Figs. [3] and [4] The results 
show that for practical setting, the packet overhead cost of our 
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Fig. 4. Packet overhead cost for N=100 and p=0.8. 

proposed coding scheme is lower than that of RLNC. For a 
large network (see Fig. @J, the packet overhead cost of DLNC 
is similar as that of triangular coding. Even for a large network 
with a high packet loss probability as shown in Fig. [4] the 
packet overhead cost of triangular network coding is the same 
as that of a smaller network with low packet loss probability. 
In fact the only difference between the results presented in 
Fig. [5] and Fig. |4] occurs at M — 5 for triangular network 
coding, when a = 2 is required to satisfy inequality @] which 
corresponds to a slightly higher packet overhead cost. 

The sudden increase of packet overhead cost for triangular 
substitution from M = 30 to M = 35 and M = 60 to M = 65 
is explained from the presence of a ceiling logarithm to base 
two term in the packet overhead cost of our proposed scheme. 

VII. Conclusion 

The characteristics of various coding schemes, including 
triangular coding are summarized in Table U In this work 
we have proposed an optimal triangular network coding 
scheme over GF{2), and demonstrated its throughput and 
computation cost merits over previously proposed network 
coding schemes. Unlike traditional linear network coding 
scheme which requires the use of multiplication and Gaussian 
elimination method at the encoder and decoder respectively, 
our proposed coding scheme uses XOR addition and back- 
substitution method at the encoder and decoder respectively. 
This way, rather than decreasing the number of computation 
steps, we decrease the order of computational complexity for 
both encoding and decoding, without compromising on the 
throughput performance. Such a coding scheme is throughput 
optimal, and its packet overhead increases linearithmic with 
respect to M, which can be optimized such that the packet 
overhead increases linearly with respect to M. Most impor- 
tantly, triangular network coding performance is independent 
of the packet feedback information. 



TABLE I 

Summary of the characteristics of various coding schemes. 





RLNC 


DLNC (DGC) |5| 


Sparse LNC 1 19] 


Traditional XOR 


Triangular Network Coding 


Throughput 


Suboptimal 


Optimal 


NP-complete 


NP-complete 


Optimal 


Algorithm Complexity 


0(M 2 G) 


C(Af 2 M 4 ) 


0(M" N 2 ) 


0{M 2 \og 2 M) 1 1 5 1 


0(M 2 ) 


Encoding Complexity 


0(M 2 qB) 


0{M 2 qB) 


O(MquiB) 


O(MmB) 


0(M 2 B) 


Decoding Complexity 


0{M*qB + M 6 ) 


Q{M*qB + M 3 ) 


0(M 2 qB + uiM 2 ) 


O(MmB) 


0(M 2 B) 


Packet Overhead (bits) 


Mq 


A/|log 2 JV| 


M\\og 2 N\ 


M 


M + M|log 2 M| 


Packet Feedback 


Not required 


Required 


Required 


Required 


Not required 



In addition, apart from the practical implementation signif- 
icance of our coding scheme, such results are also of interest 
from an information theoretic studies perspective, as unlike 
previous works, we have shown that it is possible to obtain 
optimal coding solution over GF(2) by adding few redundant 
bits in the packets. 

Such a coding scheme would in particular be of interest in 
energy and processor constraint devices. For our future work, 
we would like to study the feasibility of extending our current 
coding scheme for a distributed multi-hop wireless network. 
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